Simcoe County
Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs
Wang, Ziliang, An, Kang, Zheng, Xuhui, Qian, Faqiang, Zhang, Weikun, Ouyang, Cijun, Cai, Jialu, Wang, Yuhang, Wu, Yichao
While search-augmented large language models (LLMs) exhibit impressive capabilities, their reliability in complex multi-hop reasoning remains limited. This limitation arises from three fundamental challenges: decomposition errors, where tasks are incorrectly broken down; retrieval missing, where key evidence fails to be retrieved; and reasoning errors, where flawed logic propagates through the reasoning chain. A single failure in any of these stages can derail the final answer. We propose Erasable Reinforcement Learning (ERL), a novel framework that transforms fragile reasoning into a robust process. ERL explicitly identifies faulty steps, erases them, and regenerates reasoning in place, preventing defective logic from propagating through the reasoning chain. This targeted correction mechanism turns brittle reasoning into a more resilient process. Models trained with ERL, termed ESearch, achieve substantial improvements on HotpotQA, MuSiQue, 2Wiki, and Bamboogle, with the 3B model achieving +8.48% EM and +11.56% F1, and the 7B model achieving +5.38% EM and +7.22% F1 over previous state-of-the-art(SOTA) results. These findings suggest that erasable reinforcement learning provides a powerful paradigm shift for robust multi-step reasoning in LLMs.
- North America > United States > Massachusetts > Suffolk County > Boston (0.14)
- North America > United States > Massachusetts > Norfolk County > Wellesley (0.14)
- Europe > Poland > Masovia Province > Warsaw (0.06)
- (15 more...)
- Media (1.00)
- Leisure & Entertainment (1.00)
- Government > Voting & Elections (1.00)
- (2 more...)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Do small language models generate realistic variable-quality fake news headlines?
McCutcheon, Austin, Brogly, Chris
Small language models (SLMs) have the capability for text generation and may potentially be used to generate falsified texts online. This study evaluates 14 SLMs (1.7B-14B parameters) including LLaMA, Gemma, Phi, SmolLM, Mistral, and Granite families in generating perceived low and high quality fake news headlines when explicitly prompted, and whether they appear to be similar to real-world news headlines. Using controlled prompt engineering, 24,000 headlines were generated across low-quality and high-quality deceptive categories. Existing machine learning and deep learning-based news headline quality detectors were then applied against these SLM-generated fake news headlines. SLMs demonstrated high compliance rates with minimal ethical resistance, though there were some occasional exceptions. Headline quality detection using established DistilBERT and bagging classifier models showed that quality misclassification was common, with detection accuracies only ranging from 35.2% to 63.5%. These findings suggest the following: tested SLMs generally are compliant in generating falsified headlines, although there are slight variations in ethical restraints, and the generated headlines did not closely resemble existing primarily human-written content on the web, given the low quality classification accuracy.
- North America > United States > Michigan (0.04)
- North America > Canada > Ontario > Simcoe County > Orillia (0.04)
- North America > United States > Montana (0.04)
- Africa > South Africa (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
Did ChatGPT or Copilot use alter the style of internet news headlines? A time series regression analysis
Brogly, Chris, McElroy, Connor
The release of advanced Large Language Models (LLMs) such as ChatGPT and Copilot is changing the way text is created and may influence the content that we find on the web. This study investigated whether the release of these two popular LLMs coincided with a change in writing style in headlines and links on worldwide news websites. 175 NLP features were obtained for each text in a dataset of 451 million headlines/links. An interrupted time series analysis was applied for each of the 175 NLP features to evaluate whether there were any statistically significant sustained changes after the release dates of ChatGPT and/or Copilot. There were a total of 44 features that did not appear to have any significant sustained change after the release of ChatGPT/Copilot. A total of 91 other features did show significant change with ChatGPT and/or Copilot although significance with earlier control LLM release dates (GPT-1/2/3, Gopher) removed them from consideration. This initial analysis suggests these language models may have had a limited impact on the style of individual news headlines/links, with respect to only some NLP measures.
- North America > Canada > Ontario > Simcoe County > Orillia (0.04)
- North America > United States > Michigan (0.04)
- Research Report > Experimental Study (0.64)
- Research Report > New Finding (0.50)
Investigation of the Impact of Economic and Social Factors on Energy Demand through Natural Language Processing
Bai, Yun, Camal, Simon, Michiorri, Andrea
These authors contributed equally to this work. Abstract The relationship between energy demand and variables such as economic activity and weather is well established. However, this paper aims to explore the connection between energy demand and other social aspects, which receive little attention. Through the use of natural language processing on a large news corpus, we shed light on this important link. This study was carried out in five regions of the UK and Ireland and considers multiple horizons from 1 to 30 days. It also considers economic variables such as GDP, unemployment and inflation. We found that: 1) News about military conflicts, transportation, the global pandemic, regional economics, and the international energy market are related to electricity demand. Electricity demand modelling is a fundamental process in power system planning, operation, and energy trading [1]. In order to avoid additional carbon emissions from excess electricity generation and the high costs of electricity storage, electricity demand and supply should be matched over time [2]. Demand forecasting has become a means of enabling power dispatch, planning generation schedules, and integrating renewable energy sources [3]. Electricity demand forecasting is linked to various factors, including weather, economic activity, and major events.
- Europe > Ireland (0.26)
- Europe > United Kingdom > England > East Midlands (0.06)
- Europe > United Kingdom > Northern Ireland (0.06)
- (10 more...)
- Energy > Power Industry (1.00)
- Banking & Finance > Economy (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)
Leveraging Compliant Tactile Perception for Haptic Blind Surface Reconstruction
Cheret, Laurent Yves Emile Ramos, da Fonseca, Vinicius Prado, de Oliveira, Thiago Eustaquio Alves
Non-flat surfaces pose difficulties for robots operating in unstructured environments. Reconstructions of uneven surfaces may only be partially possible due to non-compliant end-effectors and limitations on vision systems such as transparency, reflections, and occlusions. This study achieves blind surface reconstruction by harnessing the robotic manipulator's kinematic data and a compliant tactile sensing module, which incorporates inertial, magnetic, and pressure sensors. The module's flexibility enables us to estimate contact positions and surface normals by analyzing its deformation during interactions with unknown objects. While previous works collect only positional information, we include the local normals in a geometrical approach to estimate curvatures between adjacent contact points. These parameters then guide a spline-based patch generation, which allows us to recreate larger surfaces without an increase in complexity while reducing the time-consuming step of probing the surface. Experimental validation demonstrates that this approach outperforms an off-the-shelf vision system in estimation accuracy. Moreover, this compliant haptic method works effectively even when the manipulator's approach angle is not aligned with the surface normals, which is ideal for unknown non-flat surfaces.
- North America > Canada > Ontario > Simcoe County > Orillia (0.14)
- North America > Canada > Newfoundland and Labrador > Newfoundland > St. John's (0.14)
- North America > Canada > New Brunswick > Saint John County > Saint John (0.04)
- Africa > Central African Republic > Ombella-M'Poko > Bimbo (0.04)
Driverless cars: Researcher disguises himself as car seat in study
A study to test people's reactions to driverless cars has used a "ghost driver" to record their responses. The work, by the University of Nottingham, found that, in the absence of someone in the driving seat, pedestrians trust certain visual prompts more than others when deciding whether to cross the road. As part of the study, a car was driven around the university's campus over several days with its driver - research fellow David R. Large - concealed in the driver's seat. Mr Large, senior research fellow with the Human Factors Research Group at the university, said: "We wanted to explore how pedestrians would interact with a driverless car and developed this unique methodology to explore their reactions." Follow BBC East Midlands on Facebook, on Twitter, or on Instagram.
- North America > Canada > Ontario > Simcoe County > Midland (0.31)
- Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.31)
- Europe > United Kingdom > England > East Midlands (0.31)
- Transportation > Passenger (0.94)
- Transportation > Ground > Road (0.94)
- Information Technology > Robotics & Automation (0.94)
Chatsworth's hidden 17th Century garden revealed in drone footage
A hidden 17th Century garden that emerged during a heatwave has been shown in new drone footage. The European-style formal garden at the Chatsworth Estate in Derbyshire was designed in 1699 for the 1st Duke of Devonshire. It was grassed over 30 years later but substantial remains lie buried under just a thin layer of soil and grass, which has since been parched by the recent dry weather. While the historic design will not be fully restored any time soon, Steve Porter - head of gardens and landscape at Chatsworth - said he hoped the old garden, known as the Great Parterre, could be recreated with gravel once the grass had recovered. "Every time you look you almost see more of the detail, more of the scrolls of the beds and more of the paths and it sort of brings it all back to life and you realise just how intricate and just how amazing it would have been," he added. Follow BBC East Midlands on Facebook, Twitter, or Instagram.
- North America > Canada > Ontario > Simcoe County > Midland (0.29)
- Europe > United Kingdom > England > East Midlands (0.29)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Career Profile: Andrew E. Brereton - Computational Scientist
I was born/grew up in: I was born in Nova Scotia, but grew up in Parry Sound, Ontario. I now live in: I now live in Barrie, Ontario, and work remotely for a company headquartered in Toronto. I work now at a company called Cyclica. We are a biotechnology company that uses Artificial Intelligence (AI) to help make medicines that are more effective for patients. I do research and develop methods for computational drug design.
- North America > Canada > Ontario > Toronto (0.26)
- North America > Canada > Ontario > Simcoe County > Barrie (0.26)
- North America > Canada > Nova Scotia (0.26)
- North America > United States > Oregon (0.06)
Machine Learning Innovation in the Information, Communications and Technology Industries
Have you ever visited a favorite e-commerce website, and noticed the website was recommending some products you had looked at before? Have you taken a picture of your friend with your smartphone, and the phone asked you to confirm whether or not it was, in fact that friend by name, based on pictures you had tagged before? Have you heard about Google's self-driving car, Skype's emerging translation capabilities or IBM Watson, which is helping doctors to diagnose and treat patients more effectively? There are many names for the magic behind this wizardry, including artificial intelligence, machine learning or cognitive technology. Regardless of what you call it, robots, software and computing devices are evolving to become more autonomous than ever before.
Applied AI News
General Electric's Research and Elscint (Hackensack, NJ), a manufacturer Johnson Controls (Milwaukee, WI) Development Center (Schenectady, of medical imaging systems, has has begun deployment of a knowledge-based NY) has developed an expert system begun offering its customers a service engineering application which is being used to increase the option based on expert systems. The to increase the productivity of the speed of design of new jet engines, MasterMind system delivers troubleshooting engineering design function. The system, called Engineous, on laptop or desktop computers. The General (Menlo Park, CA), is conveyor for further processing. It problems and recommends solutions objects have become rotated.
- North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.25)
- North America > United States > New Jersey > Bergen County > Hackensack (0.25)
- North America > United States > California > San Mateo County > Menlo Park (0.25)
- (8 more...)
- Industrial Conglomerates (0.56)
- Aerospace & Defense (0.51)
- Electrical Industrial Apparatus (0.36)
- Automobiles & Trucks (0.36)